Goal-driven active learning

نویسندگان

چکیده

Abstract Deep reinforcement learning methods have achieved significant successes in complex decision-making problems. In fact, they traditionally rely on well-designed extrinsic rewards, which limits their applicability to many real-world tasks where rewards are naturally sparse. While cloning behaviors provided by an expert is a promising approach the exploration problem, from fixed set of demonstrations may be impracticable due lack state coverage or distribution mismatch—when learner’s goal deviates demonstrated behaviors. Besides, we interested how reach wide range goals same demonstrations. this work propose novel goal-conditioned method that leverages very small sets goal-driven massively accelerate process. Crucially, introduce concept active query demonstrator only hard-to-learn and uncertain regions space. We further present strategy for prioritizing sampling disagreement between policy maximized. evaluate our variety benchmark environments Mujoco domain. Experimental results show outperforms prior imitation approaches most terms efficiency average scores.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Goal-Driven Learning

متن کامل

Learning as Goal-Driven Inference

types of learning activities. These are instantiated with domain-specific information in the context of the performance task to yield domain-specific learning goals which, in turn, are used to drive learning. 4 Goal Dependency Networks In general, an intelligent system will have multiple learning goals that are interrelated in a very complex manner. In order to reason about the interactions bet...

متن کامل

Towards Goal-Driven Reflective Learning

Except for various ad hoc (and sometimes quite successful) systems, this de facto manifesto calling for the study of introspective systems did not give rise to what may be called "a general architecture for declarative and/or reflective machine learning". Some recent research taking place under the label of "goal-driven learning" signals however a renewed interest in these very basic issues: " ...

متن کامل

Learning and Reusing Goal-Specific Policies for Goal-Driven Autonomy

In certain adversarial environments, reinforcement learning (RL) techniques require a prohibitively large number of episodes to learn a highperforming strategy for action selection. For example, Q-learning is particularly slow to learn a policy to win complex strategy games. We propose GRL, the first GDA system capable of learning and reusing goal-specific policies. GRL is a case-based goal-dri...

متن کامل

Integrated Learning for Goal-Driven Autonomy

Goal-driven autonomy (GDA) is a reflective model of goal reasoning that controls the focus of an agent’s planning activities by dynamically resolving unexpected discrepancies in the world state, which frequently arise when solving tasks in complex environments. GDA agents have performed well on such tasks by integrating methods for discrepancy recognition, explanation, goal formulation, and goa...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Autonomous Agents and Multi-Agent Systems

سال: 2021

ISSN: ['1387-2532', '1573-7454']

DOI: https://doi.org/10.1007/s10458-021-09527-5